Picture for Xiang Zheng

Xiang Zheng

Unmasking Reasoning Processes: A Process-aware Benchmark for Evaluating Structural Mathematical Reasoning in LLMs

Add code
Jan 31, 2026
Viaarxiv icon

Just Ask: Curious Code Agents Reveal System Prompts in Frontier LLMs

Add code
Jan 29, 2026
Viaarxiv icon

BibAgent: An Agentic Framework for Traceable Miscitation Detection in Scientific Literature

Add code
Jan 12, 2026
Viaarxiv icon

AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models

Add code
Nov 15, 2025
Figure 1 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Figure 2 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Figure 3 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Figure 4 for AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Viaarxiv icon

Defense-to-Attack: Bypassing Weak Defenses Enables Stronger Jailbreaks in Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models

Add code
Jun 11, 2025
Viaarxiv icon

RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming

Add code
Jun 04, 2025
Figure 1 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming
Figure 2 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming
Figure 3 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming
Figure 4 for RedRFT: A Light-Weight Benchmark for Reinforcement Fine-Tuning-Based Red Teaming
Viaarxiv icon

PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking Attacks

Add code
May 22, 2025
Viaarxiv icon

Reinforced Diffuser for Red Teaming Large Vision-Language Models

Add code
Mar 08, 2025
Viaarxiv icon

SCORE: Saturated Consensus Relocalization in Semantic Line Maps

Add code
Mar 05, 2025
Figure 1 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Figure 2 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Figure 3 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Figure 4 for SCORE: Saturated Consensus Relocalization in Semantic Line Maps
Viaarxiv icon